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A wavelet transform algorithm is applied to image compression. It is observed 
that the algorithm does not suffer from the blockiness characteristic of the DCI- 
based algorithms at compression ratios exceeding 25:1, but the edges do not appear 
as sharp as they do with the latter method. Some suggestions for the improved 
performance of the wavelet transform method [ are presented. 


I. Introduction 

The application of wavelet transforms and multireso- 
lution analysis to data compression has attracted much 
attention recently. This circle of ideas is closely related to 
the subband compression and the pyramid encoding tech- 
niques. The general idea is to transform and reorganize 
the data in a hierarchical manner so that the upper levels 
of this hierarchy (or pyramid) represent the general fea- 
tures of the data or the image and the lower levels supply 
the details. Generally the higher levels of the pyramid are 
smaller data sets than the lower levels; however, the coeffi- 
cients in the latter portion are more correlated than those 
in the former and are better compressed by the standard 
lossless compression techniques. 

The applications of wavelet representations to practi- 
cal engineering problems are not limited to source coding. 
For example, one encounters situations that necessitate 
selecting a subset of a large data set on the basis of cer- 
tain characteristics. One may achieve this by browsing 
through the higher levels of the hierarchy, which comprise 
a much smaller data set, examining the general features of 


the data, and making judicious choices. The coefficients 
in the lower levels of the pyramid may be used for edge 
detection. 

As in other methods of data compression, applica- 
tions of wavelet transforms to source coding assume a 
priori knowledge of the tolerable level of information loss 
and/or the desirable compression ratio. Data compression 
is achieved by quantizing the transformed data and allo- 
cating bits to the different levels of the pyramid of the 
transformed data in a manner compatible with the con- 
straints and the requirements of the particular application. 
Naturally, in source coding applications more bits are al- 
located to an individual coefficient in the higher levels of 
the pyramid than to one in a lower level. In analogy with 
the discrete cosine transform (DCT), one may regard the 
lower levels of the pyramid as the high frequencies and the 
upper ones as the low frequencies. 

The presentation of the general theory of wavelet trans- 
forms in Section II is intended for application to data com- 
pression. The literature on the subject is often inadequate 
regarding the implementation of the basic ideas, and the 
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theoretical aspects of the subject seem to be only remotely 
reIatecTto practical engineering problems. It is hoped that 
the concise and concrete presentation of the wavelet trans- 
forms in Section II will make the literature more accessi- 
ble to interested researchers. In Section III, the practical 
aspects of image compression by wavelet transforms and 
the results of the applications are reported. Further re- 
search topics for the improvement of the performance of 
wavelet-transform-based compression algorithms are also 
suggested. Some of the advantages and disadvantages of 
the wavelet transforms versus the standard DCT tech- 
niques are discussed. However, no definitive judgment 
can be made at this time regarding their relative merits. 
While the latter approach has been studied extensively in 
the past decade, the application of wavelet transforms to 
image compression has not reached the level of maturity 
that would warrant definitive assessment of its merits and 
potential. 


fine the action of A at t on a function <p by A at t,(<p)(x) = 
- b)/a ]. It is convenient to introduce the nota- 
tion <Pm t n(x) = j42 m ,2 m n(v ? )(z)> for m and n integers, and 
note that a function / G £ admits of the expansion 


oo 

Pm(/)= Y a nXm,n (2) 

n= — OO 


where \ >s the indicator function of the interval [0,1), and 
= f Xm,n(x)f(x)dx. The functions \ m n are obtained 
from the single function \ through the action of a set of 
affine transformations of the line. For each fixed m, 


£ m = span {,Ym,n|n € Z} 

/ G C m n — ^ /( 2.) G -Cm-l 


( 3 ) 


II. Wavelet Transforms 

The idea of wavelet transforms and their applicability 
to signal analysis, and especially data compression, is most 
easily demonstrated by focusing on the one-dimensional 
case first. A straightforward generalization of the theory 
to two dimensions for application to image compression is 
indicated at the end of this section. In this case, a data 
set is represented by an element of C = L 7 (R). Consider 
the following sequence of partitions of R: 

OO 

Partition V m : R = (J 

n = — oo 

where I n>m = [2 m n,2 m (n + 1)), and let £ m be the sub- 
space of £ consisting of functions that are constant on the 
intervals I n rn . The operator p m of orthogonal projection 
on the subspace £ m is 

Pm (/)(*) = — f f{x)dx 
1 


From the expansion ( 2 ) one easily obtains the expansion 
of / following the decomposition £ = 0£ m . First observe 
that 


Xm+l,n — ^2 (Xm t 2n + Xm,2n+l) 


Therefore, a™* 1 — + a ™n+i)> an ^ after a simple 

calculation one obtains 

Pm(/) “ Pm + l(f) = 2 “ a 2n + l)(Xm,2n Xm t 2n + l) 

Now set <pm,n — ^72 (Xm,2n — Xm,2n+i) to obtain the ex- 
pansion 


f~ Y 6 n^m,n (4) 


where 7 m (x) is the unique interval I n>m ( m fixed) contain- 
ing x. The subspaces £ m have the following properties; 


£m + 1 Q £m ) £m — 0> £ m — £ (1) 

Let £ m denote the orthogonal complement of £ m+1 in £ m> 
then £ admits of the orthogonal direct sum decomposi- 
tion £ = ®£ m . Denote orthogonal projection on £ m by 
7 r m . Let A 0 ) t,, where a ^ 0 and b are real numbers, de- 
note the affine transformation , 4 a) &(x) = ax + 6, and de- 


where 6™ = ^(ag, - a£ +1 ). It is a remarkable fact, and 
easy to prove, that the functions are also obtained 

from the single function <p(x) = x(2x) - *(2x - 1) by 
the action of the set A - {A 2 m t 2 m n \m,n G Z} of affine 
transformations, and an analogue of condition (3) is valid 
for the subspaces £ m , namely, 


£ m - span {v? mi „|n G Z} 

/ G £ m <=> /( 2 .) G £m - 1 


( 5 ) 
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The functions {y?m,n} form a complete orthonormal set 
for £. The expansion (4) is an example of orthonormal 
wavelet expansion, and the coefficients b ™ are called the 
wavelet coefficients. 

To understand the intuitive meaning of the expansion 
(4), assume that / € £ m for a sufficiently large nega- 
tive number m. The projection of / on £ m is tt m(/) — 
/ - Pm+i(/). Now p m +i(/) is a slightly smoothed version 
of / so that 7 r m (/) represents the details that are miss- 
ing from the smoothed version p m +i(/)- Thus, it m (/) or 
more precisely, the coefficients 6™ in the expansion (4) be- 
long to the lowest level of the hierarchy. The process can 
be repeated with Pm+i(/) replacing /, thus leading to a 
hierarchy of the coefficients of the wavelet expansion of / . 

An important feature of the expansion (4) is that the 
coefficients b™ and a™ can be computed recursively in a 
simple manner. For example, to compute from a nJ 
substitute expansion (2) for m = 0 in the formula for a\ 
to obtain 

oo . 1 

fl n = E / - * " 2n Mx/2) dx (6) 

Therefore, if one defines a(k) as the integral in expansion 
(6) for n - 0, one obtains the formula 

oo 

a„ = ^2 a(k - 2 n)a° k (7) 

Jb=-oo 

Similarly, 


b l n = 53 P( k - 2n ) a ° 

fc = - oo 

where 0(1 fe) = -fe J x(x - k)ip(x/2)dx. By a straightfor- 
ward inductive extension of this calculation, one can ex- 
press 6™ +1 and a™ +1 in terms of a™. The resulting formu- 
lae are identical with formulae (7) and (8) with m and m+1 
replacing 0 and 1, respectively. Therefore, the wavelet co- 
efficients 6™ and a™ can be computed by the filters defined 
by a and /?. 

The orthonormal basis {<Pm.nl anc * the expansion (4) 
are just one example of an orthonormal wavelet expansion. 
To obtain other expansions, one has to abstract some of 
the features of this illustrative example. The essential in- 
gredients of the theory are an orthonormal doubly infinite 


basis {<pm.nl for C such that the functions <p m ,n are ob- 
tained from a single function <p via the action of the set 
*4, and for which condition (5) is valid. For applications, 
knowledge of the corresponding filters (3 and a is essen- 
tial. Since in practical engineering problems the data are 
normally in digital form, it is important to adapt the theo- 
retical framework of wavelets to the discrete or digital case 
before discussing other wavelet expansions. 

In the digital domain, /"(Z) replaces L 2 ( R) as the space 
of one-dimensional data. One can naturally identify / 2 (Z) 
with C o, and therefore the theory developed above extends 
to this case immediately. The only difference is 

C-j = Co, p-j=id., and tt_ ; - = 0 for j > 0 (9) 

It follows that formulae (2) through (8) remain valid, pro- 
vided that the range of the values of m is limited to 0 to oo. 
In practice, the domain of n is (Z mod 2^) for some in- 
teger N. Therefore, Cs = R> and the linear spaces C m 
are finite dimensional. The bases {Xn*} an ^ {Xri 1 > } 

for Cm = £m+ 1 0 differ by an orthogonal transforma- 
tion. It follows that the coefficients {a™} and {a™ +1 ,b™} 
are also related by an orthogonal transformation. This 
orthogonal transformation, which is the matrix represen- 
tation of the filters a and /?, is given by the 2 N x 2 N matrix 
with 2x2 diagonal blocks 

/ 1/\/2 l / x/2 \ 

Vw 2 -wv 

This means that given a data set represented by a 
column vector (/o, .... f2 N -i) tr > ^ ie application of the 
above matrix transforms it into a vector (go, • • , 92 N -\) tr 
with the even-numbered components (<7oi <72» • * ■ » 92 n -t) r 
representing pi(/) and the odd-numbered ones 
(g l} g 3} representing 7r x (f). Here the su- 

perscript tv means the transpose of the matrix or vector. 

The problem of determining other orthonormal wavelet 
expansions, and especially the corresponding filters, is dis- 
cussed in detail in [1]. Of particular interest in practical 
problems is the case where the functions a and (3 have 
small support, i.e., a(j) = 0 = /3(j) for most j’s. It is the 
knowledge of the functions (or filters) a and /?, and not 
the basis functions themselves, that is essential for appli- 
cations. In [1], the filters a and (3 of small support are 
explicitly determined. The simplest of these filters is the 
one given above. The next simplest one is the matrix T 
given by 
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(10) 

where 








<*o = 

l + v/3 

4y/2 

, Or 1 

_ 3 -f- \/3 
V2 ’ 

C*2 = 

3 — v/3 
4/2 ’ 

a 3 ~ 

1-/3 

4/2 


One should note that an important feature of the orthonor- 
mcd wavelet expansion is that the inversion procedure can 
be implemented by the transpose of the orthogonal matrix 
representing the filters a and /?, 

The above theory was limited to one-dimensional data* 
It can be easily adapted to the two-dimensional case by 
considering products of the basis functions considered in 
the one-dimensional case. This is equivalent to carrying 
out the one-dimensional wavelet transforms in the hori- 
zontal and vertical directions. The practical aspects of 
the two-dimensional wavelet transform are discussed in de- 
tail in the next section. Of course, there are orthonormal 
wavelet expansions that may not be separable, i.e., the ba- 
sis functions are not products of the basis functions for the 
one-dimensional case, but they will not be considered in 
this article. 


III. Application to Data Compression 

To apply the theory to data compression, one fixes an 
orthonormal wavelet expansion, or equivalently, the fil- 
ters a and (3, In the work ‘reported here, only the filter 
defined by the matrix J 7 was used. 1 An image is repre- 
sented by a matrix / = (fa), where fa is the intensity 
of the pixel (i, j). For each fixed row ?, one considers 
the transform g { = F// r , where /, is the ith row of the 
matrix /. The components of gi with even indices rep- 
resent pi(fi) and those with odd indices represent tt\ (/,*). 
It is convenient to reorganize the vector gi in the form 


1 It is often unclear from the literature what filter is actually used. 

The filter used in [3] differs from that defined by T . 


( 00 , 02 , ...» 02^— 2 > 01,03, 02 *-i)- Now the process 

is repeated for the columns of the matrix of the trans- 
formed rows. After reorganizing, the transformed matrix 
of pixel intensities takes the form 


where each g ij is a 2 isr ‘ 1 x 2^“ 1 matrix. Since an image is 
two-dimensional, the hierarchy of the wavelet coefficients 
requires some elaboration. The matrix g 11 represents the 
smoothed version of the image, while the remaining coeffi- 
cients are the missing details. The coefficients g 12 and g 21 
belong to the level of the pyramid immediately below g 11 , 
and g 22 lies at the lowest level of the pyramid. Thus, ev- 
ery application of the wavelet transform generates three 
levels of hierarchy for a two-dimensional image. The pro- 
cess is then repeated by applying the filters a and /? to 
the 2 Ar_1 x 2 jY_1 matrix g 11 along rows and columns. The 
resulting coefficients are then reorganized in the form 

9 UAl - {g 11 ’ 12 ^ 11 ’ 21 } — * g 11 ’ 22 - {g l 2 ,g n }^g 3 2 

with the highest level at the extreme left and the lowest at 
the extreme right. The process can be repeated. It may 
be more convenient to organize the coefficients differently 
in the following form: 

\ 

9 u - 22 

i 

* 22 

One then refers to y 11 ’ 11 as SS (smooth-smooth) level 2, to 
9 xl ' 12 ,g n ‘ 21 and g 12 ,g 21 as the SD (smpoth-detail) levels 
2 and 1, respectively, and to g 11 - 22 and g 22 as the DD 
(detail-detail) levels 2 and 1, respectively. 

In the application of wavelet transforms to image com- 
pression, the coefficients at different levels of the pyramid 
are not equally significant and, therefore, should be en- 
coded differently. The wavelet coefficients of different lev- 
els were examined for severed images, and certain patterns 
were observed. In general, the coefficients at a lower level 
of the pyramid are better approximated by a Laplacian 
density function than those at the higher levels. Using 
the nearest integer truncation, one also notices that the 
entropies of the coefficients at the lower levels are smaller 



/ 

9 xx ' 12 ,g xx 21 
1 

„12 „21 
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than those in the upper ones. Figures 1 through 4 show 
the distributions of the wavelet coefficients at different lev- 
els of the hierarchy for a typical image. An approximating 
Laplacian density function is given in Figs. 1 through 3. 
Clearly the coefficients at the highest level (Fig. 4) have a 
very irregular distribution. Table 1 shows the entropies of 
the wavelet coefficients for the same image. 

The image compression process is done by first comput- 
ing the coefficients $ 22 , </ 12 , flf 21 ^ 11 ’ 22 ,* 11 ’ 12 . ^c. These 
coefficients are quantized according to a bit allocation 
scheme similar to the one used for the standard DCT- 
based algorithms. As noted above, more bits are allocated 
to the higher levels of the hierarchy than to the lower ones. 
In the pictures of the peppers compressed by the wavelet 
transform method (Fig. 1), the coefficients in the lowest 
level have been set to 0. In practice it was observed that 
because of the quantization errors inherent in any floating- 
point computation, it is not desirable to go beyond three 
or four levels of wavelet transforms. To reconstruct the 
image, the inverse filter was applied to the coefficients. As 
noted above, the inverse filter is given by the transpose of 
the orthogonal matrix defining the filter. 

There are several issues involved in the application of 
wavelet transforms to image compression. The choice of 
the appropriate wavelet transform may be dictated by the 
complexity of the image. It has been suggested that dif- 
ferent transforms may be more appropriate for different 
images or even different parts of an image. Some ideas in 
this direction appear in [2] with apparently very promising 
results. The problem of bit allocation and quantization of 
the wavelet coefficients is similar to the analogous prob- 
lem for DCT-based image compression. It may be possi- 
ble to take advantage of the regularity of the coefficients 
at the lower levels of the pyramid and use the Laplacian 
distribution to allocate bits accordingly. However, the ex- 
perimental work carried out by the author suggests that 
the simpler method of truncation to the nearest integer 
followed by decimation by an appropriate number of bits 
provides better results. Naturally, fewer bits are allocated 
to the lower levels of the pyramid than to the upper lev- 
els. A different method for quantization is proposed in 
[3], These authors suggest that using the L 1 rather than 
the L? norm is more compatible with the human visual 
perception, and their proposed technique of quantization 


method is based on minimizing the errors in the former 
norm. 

While a definitive comparison between the DCT-based 
algorithms and wavelet transform techniques is premature, 
the tests done by the author suggest some important differ- 
ences. At higher compression ratios, for example at greater 
than 25:1, the blockiness in the DCT-based techniques be- 
comes very visible. ^Vith the wavelet transform used in the 
tests, the edges were not as clearly defined as those using 
the DCT-based techniques, but no blockiness was visible. 
The rms error of the Joint Photographic Experts Group 
(JPEG) DCT-based algorithm was smaller than that of 
the wavelet transform method, but visual preference is not 
necessarily reflected by the mean square error. Figure 5 
shows an original image (peppers) on the upper left cor- 
ner. The images on the upper right and lower left were 
compressed using the wavelet transform. The compres- 
sion ratios were 10:1 and 30:1, respectively. The image in 
the lower right was obtained by the application of the stan- 
dard DCT-based JPEG algorithm. Its compression ratio is 
30:1. The rms error for the lower left image is about 11.0, 
and for the one at lower right it is approximately 7.2, even 
though the blockiness makes it much worse than the one 
at lower left. The rms error for the image on the upper 
right is about 9.4. It should be pointed out that other 
methods, such as fractal algorithms, may produce images 
that are visually preferable to the DCT-based methods for 
high compression ratios. 

The wavelet transform used in this work is the prod- 
uct of a one-dimensional algorithm with itself; that is, es- 
sentially separable into one-dimensional algorithms. It is 
possible to modify this method to make the horizontal and 
vertical directions more coupled so that the algorithm be- 
comes truly two-dimensional. The visual effects of such 
modification are unclear at this time. However, it is rea- 
sonable to expect improvements in the clarity of the edges 
if such techniques are properly employed. 

IV. Conclusion 

The wavelet transform method provides a new approach 
to image compression. Although this approach has not 
performed as well as the DCT-based algorithms in terms of 
the rms error, it appears to have certain visual advantages 
especially regarding blockiness. 
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Table 1. Entropies of the 
wavelet coefficients. 


Level 

Entropy 

DD-1 

2.5 

SD-1 

3.2 

DD-2 

2.8 

SD-2 

3.8 

DD-3 

3.4 

SD-3 

4.5 

DD-4 

4.0 

SD-4 

5.0 

SS-4 

6.4 
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Fig. 5. Imaga oomp,...lo„: (a| tb. original un =o»,pr....F Imaga; (b) comp,,..lor,,.,l» oM0:1 by ..v.I.I Mo.lorm , (e) con,pr...lob 
ratio of 30:1 by wavelet transform; and (d) compression ratio of 30:1 by DCT-based JPEG algorithm. 
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